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Abstract 

Let b be an integer greater than 1 and let W e — (W^ ; n > 0) be a random walk on the 
6-ary rooted tree U&, starting at the root, going up (resp. down) with probability 1/2 + e 
(resp. 1/2 — e) , e G (0,1/2), and choosing direction i G {1, ...,6} when going up with 
probability Oj. Here a = (ax, . . . , a^) stands for some non-degenerated fixed set of weights. 
We consider the range {W^;n > 0} that is a subtree of It corresponds to a unique 
random rooted ordered tree that we denote by r e . We rescale the edges of r e by a factor e 
and we let e go to 0: we prove that correlations due to frequent backtracking of the random 
walk only give rise to a deterministic phenomenon taken into account by a positive factor 
7(a). More precisely, we prove that r e converges to a continuum random tree encoded by 
two independent Brownian motions with drift conditioned to stay positive and scaled in time 
by 7(a). We actually state the result in the more general case of a random walk on a tree 
with an infinite number of branches at each node (b = 00) and for a general set of weights 
a = (a n ,n > 0). 

MSC 2000 subject classifications: 60F17, 60J80, 05C05, 05C80. 
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1 Introduction. 



Random walks on trees have been intensively studied by many authors having different moti- 
vations coming from group theory, discrete potential theory, statistical mechanics or genetics. 
We refer to the book of W. Woess [21] for a general introduction to random walks on infinite 
graphs and to the book of R. Lyons and Y. Peres [13] for a probabilistic approach more focused 
on trees. See also [12] for a survey of open problems concerning random walks on trees. In 
most of the papers about random walks on trees, given the tree-like environment the transition 
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probabilities of the random walk are fixed and one focuses on a certain range of questions: the 
speed of the random walk (see [18] for random walks on groups, [20] for random walks on periodic 
trees, [14] and [15], for random walks on Galton- Watson trees), large deviation principle for the 
distance- from-the-root process (see [8] for random walks on Galton- Watson trees), central-limit 
theorem for the distance-from-the-root process and the number of visited vertices (see [4] for the 
fr-ary tree and [17] for the simple random walk on supercritical Galton- Watson trees). In this 
paper, we consider a different problem; the transition probabilities are not fixed: we study, near 
criticality, transient random walks on the 6-ary rooted tree and more generally on the oo-ary 
tree, in a "diffusive" regime. 

Let us specify that we only consider ordered rooted trees that are formally defined as in [16]: 
Let N = {0, 1, 2, . . .} be the set of the nonnegative integers, set N* = N \ {0}. The oo-ary tree is 
the set U = {0} U Un>i(^*)™ °f the finite words written with positive integers by. Let u G U be 
the word u± . . . u n , U{ G N*. We denote the length of u by \u\ : \u\ = n. \u\ is viewed as the height 
of the vertex u in U. Let v = v \ . . . v rn G U. Then the word uv stands for the concatenation of 
u and v: uv = u\ . . . u n v\ . . . v m . Observe that U is totally ordered by the lexicographical order 
denoted by < . A rooted ordered tree t is a subset of U satisfying the following conditions 

(i) G t and is called the root of t. 

(ii) If v G t and if v = uj for some j € N*, then, u G t. 

(iii) For every u G t, there exists k u (t) > such that uj G t for every 1 < j < k u (t). 

We denote by T the set of ordered rooted trees. Let us mention that we sometimes see ordered 
rooted trees as family trees. So, we often use the genealogical terminology instead of the graph- 
theoretical one. All the random objects introduced in this paper are defined on an underlying 
probability space denoted by (f2,jF,P). Let e G (0,1/2) and let a = (a n ,n > 1) be some 
non-degenerated fixed set of weights, namely ^a ra = 1 and < a n < 1 , n > 1. 

We attach to the infinite tree U a cemetery point d ^ U situated at height (—1) and we view 
d as the parent of the root 0. Then, we let run a particle on U U {d} that evolves as follows: 

• The particle starts at at time and it stops when it reaches d. 

• If at time n the particle is at vertex v G U, then it jumps down to the parent of v with 
probability 1/2 — e and it goes up with probability 1/2 + e. 

• When going up, the particle chooses direction j G N* and jumps to the vertex vj G U with 
probability aj. 

The height of the particle evolving in U U {d} is then distributed as a random walk on Z 
started at 0, stopped when reaching state —1, and whose possible jumps are (+1) with probability 
1/2 + e and (—1) with probability 1/2 — e. In this paper we condition the particle to never 
reach d (observe that this conditioning is non singular). We denote by W e = (W^ ; n > 0) the 
sequence of vertices in U visited by the conditioned particle. 
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We study the range {W* ; n > 0} when e goes to zero. Observe that it is an ordered rooted 
subtree of U. There exists a unique ordered rooted tree r e € T corresponding to {W* ; n > 0} via 
a one-to-one map that fixes the root 0, preserves adjacency and that is increasing with respect 
to the lexicographical order. 

Since W e goes to infinity, r e has one single infinite line of descent. Following Aldous's termi- 
nology introduced in [2] we call sin-tree such trees (see Section 2.1 for precise definitions). The 
distribution of r e is not simple and it shows correlations due to frequent backtraking of the ran- 
dom walk (see comments in Section 2.3). However Theorem 2.1, which is the main result of the 
paper, asserts that r e converges in distribution to some continuum random tree. More precisely, 
think of T e as a planar graph embedded in the clockwise oriented half-plane and suppose that 
its edges have length one; consider a particle visiting continuously the edges of r e at speed one 
from the left to the right, going backward as less as possible; we denote by C s (r e ) the distance 
from the root of the particle at time s and we call the resulting process C(r e ) = (C s (r e ); s > 0) 
the left contour process of r e . It is clear that the particle never reaches the part of r e at the 
right hand of the infinite line of descent; observe however that C(r e ) completely encodes the 
left part of r e . Denote by C'(t £ ) the process corresponding to a particle visiting r e from the 
right to the left. Thus, (C (r t ) , C* (r t )) completely encodes r e (see Section 2.2 for more careful 
definitions and other encodings of sin-trees). Let D and D' be two independent copies of the 
process s — > B s — 2s — 2inf r < s (l? r — 2r) where B is distributed as the standard linear Brownian 
motion started at 0. Theorem 2.1 asserts that the following convergence 



holds in distribution in C([0, oo), M 2 ) endowed with the topology of uniform convergence on 
compact sets. We see that correlations in T t only give rise to a deterministic phenomenon 
characterized by a constant 7 = 7(a) that is defined by 



where (X n ;n > 1) stands for a sequence of i.i.d. {a n , n > l}-valued random variables whose 
distribution is given by P(X n = ai) = Yl cij, the sum being taken over the j's such that dj = a^. 
Observe that if b is some integer greater than 1 and if a n = for all n > b + 1, then the particle 
remains in the 6-ary ordered rooted tree = {0} U Un>i{l> • • • > b} n ■ More comments about 
this limit theorem are added before and after the statement of Theorem 2.1. 

Before ending this section, let us give a short overview of the proof of the theorem: one 
part of the proof relies on a specific encoding of the range {W„ ; n > 0} that can be explained as 
follows: Denote by (| W*| ; n > 0) the sequence of successive heights of the particle. It is obviously 
distributed as a random walk started at whose possible jumps are (+1) with probability 1/2 + e 
and (—1) with probability 1/2 — e, conditioned to stay nonnegative. Then, the piecewise linear 
process 




I/7 = E (1 + X l + X t X 2 + X x X 2 Xj, + ...) 
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is the contour process of an infinite "Active" tree denoted by r e whose distribution can be infor- 
mally described as follows: T t has one infinite line of descent; at each vertex v on the infinite line 
of descent an independent random number with distribution n of independent Galton- Watson 
trees with offspring distribution fx is attached at the left of the infinite line. Here, \i stands for 
the probability measure on N given by n(k) = (l/2 + e)(l/2-e) fc , k > (see Section 2 for precise 
definitions concerning trees and Lemma 3.1 for the details). 

We then encode the walk (W^ ; n > 0) by the tree T € and random marks Ji u G N* , u G f e 
that are defined as follows: Let u G r e be distinct from the root 0. Denote *u its parent. 
By definition of the contour process the edge ( u , u) corresponds to a unique upcrossing of the 
process (|W^| ;n > 0) between times n(u) and n(u) + 1. Thus, there exists j G N* such that the 
word W£, u \ +1 is written W^,sj and we set ~fi u = j. Then, we easily check that conditional on 
r e , the marks Ji u , u G T t \ {0} are independent and distributed on N* in accordance with a (see 
Section 3.1 for details). We get back the walk W e from the marked tree T t = (f e ; (~fi u , u G r e )), 
in the following way: consider u G r e , distinct from the root at height \u\ = n; denote by 
uq = , u\ , . . ., u n = u the ancestors of u listed in the genealogical order. Then we define 
the track of u: Trg= e («) by the word ~fi Ul . . - Ji Un G U (observe that the mark of the root plays no 
role). Then, 

and thus 

Tr Te (r e ) = 0^;n>0}. 

Taking the trace of r e has two distinct effects: the first one shuffles T t in the order of the marks 
in N*. The second one shrinks the tree because several edges of T € might correspond to the same 
vertex in U. 

Let us briefly explains how to deal with the shuffling effect of the tree: it is possible to reorder 
randomly the marked tree T e into a new marked tree T e = (r e ; (jl u ,u G r e )) such that: 

• % has the same distribution as the tree obtained from r e by changing independently and 
uniformly at random the order of birth of brothers in r e . 

• If u\ , U2 G T e are such that u\ < U2 then 

Tr fe ( Ul ) <Tr ft (u 2 ) 

(see Section 3.1 for a precise definition). Thus, the shuffled tree r e has a simple distribution 
specified by Remark 3.1. Up to the shrinking effect, r e is close to r e and if we denote by 
(C(r e ), C'(r e )) the left and the right contour processes of r e we prove in Section 2.1 that 

(eC s/e ,(r e ) , eC' s/e2 (r e )) (2D S , 2D* S ) S>0 , (2) 

i • distribution in C([0, oo), M 2 ) endowed with the topology of uniform convergence on compact 
sets. Denote by d the graph distance in U. Informally speaking, (2) says that the metric space 
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(r e , e . d) converges to some random metric space that is a continuum random trees encoded by D 
and D* (see Comment (c) after Theorem 2.1 for a more precise discussion of that point). Next, 
observe that for any U\ and Ui in % 

< d(«i,«2) -d (T^ui^T^ua)) < 2G(izi,u 2 ) 

where conditional on u±,U2 € r e , G(u±,U2) is a random integer with a geometric distribution 
with parameter q = P(Xl ^ -X2), which is a quantity that does not depend on e. This gives an 
informal argument to explain why the limits of the metric spaces (r e ,e.d) and (r e ,e.d) should 
be close and why tightness for the contour processes of r e is not the difficult part of the proof: 
it is deduced from (2) by (now standard) arguments inspired from the proof of Theorem 20 in 
Aldous's paper [3]. 

The technical point of the paper concerns the identification of the limiting tree by study- 
ing precisely the "shrinking effect" via explicit computations for the U-indexed Markov process 
(Z v ,v £ U) given by 

Z v = #{u£t 6 : Tr fc (u)=v}. 

This analysis is done in Proposition 3.3 and in Proposition 3.4. More precisely, if we fix a real 
number x > and if we remove from r e all the descendents of the unique vertex at height [x/e] 
on the infinite line of descent, we get a finite tree denoted by rf . Let U be a uniform random 
variable in [0, 1] independent of r e . Denote by U(e) the vertex of rf coming in the J-th 
position in the lexicographical order. Set 

V(e) = Wo}" 
v<Tr % (U(e)) 

Then, we prove that 

in probability. This key result is stated more precisely in Lemma 3.7. 

The paper is organized as follows: In Sections 2.1 and 2.2 we specify our notations and we 
define various encodings of trees and forests; Theorem 2.1 is stated at Section 2.3; Section 3 is 
devoted to its proof that relies on a certain combinatorial representation of the range {W^, n > 0} 
given at Section 3.1 and on a technical estimate (Lemma 3.7) whose proof is postponed at Section 
3.3 while the proof of Theorem 2.1 itself is done at Section 3.2. 

2 Preliminaries and definitions 

2.1 Trees, forests and sin-trees 

We first start with some notations. We define on U the genealogical order =<! by 

Vu, v G U , u ^ v 3w G U : v = uw. 
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If u =<! v, we say that u is an ancestor of v. If u is distinct from the root, it has an unique 
predecessor with respect to ^ that is called its parent and that is denoted by \t. We define the 
youngest common ancestor of u and v by the ^-maximal element w G U such that w =4 u and 
w =<! v and we denote it by u A v. We also define the distance between u and v by d(u, v) = 
\u\ + \v\ — 2\u A v\ and we use notation [[it,i>]] for the shortest path between u and v. Let t G T 
and u £ t. We define the tree t shifted at u by 9 u {t) = {uGD : to 6 and we denote by [t] u 
the tree t cut at the node u : [t] u := {u} U {v £ t : v Au ^ u}. Observe that [t] u G T. For any 
ui, . . . , u k G t we also set [i] ul ,..., Ujt := (~1 . . . n [t] Uk and 

[An = [t]{uet: \u\=n) = {u <E t : \u\ < n} , n>0. 

Let us denote by Q the cr-field on T generated by the sets {t £ T : u G t} , u G U and 
let /i be a probability distribution on N. We call Galton- Watson tree with offspring distribu- 
tion /j, (a GW(/x)-tree for short) any (J 7 , (y)-measurable random variable r whose distribution is 
characterized by the two following conditions: 

(i) P(fc (r) = *) = , * > o. 

(ii) For every i > 1 such that ,u(i) 7^ 0, the shifted trees 0i(t), . . . ,0i(r) under 
P(- I k (r) = i) are independent copies of r under P. 

Remark 2.1 Let u\,...,Uk G U such that Uj A Uj ^ {ni, . . . , Uk} , 1 < i,j < k, and let r 
be a GW(//)-tree. Then, conditional on the event {ui, . . . ,Uk G r}, 9 ui (t), . . . ,0 Ufc (r) are i.i.d. 
GW(/i)-trees independent of [r] uli ... ;Ufc . 

We often consider a forest (i.e. a sequence of trees) instead of a single tree. More precisely, 
we define the forest / associated with the sequence of trees (ij ; Z > 1) by the set 

/ = {(-l,0)}uJ{(/,n), uG^} 
l>i 

and we denote by F the set of forests. Vertex (—1,0) is viewed as a Active root situated at 
generation —1. Let u' = (l,u) G / with I > 1; the height of v! is defined by \u!\ := \u\ and its 
ancestor is defined by (I, 0). For convenience, we denote it by 0/ := (/, 0). As already specified, 
all the ancestors 0i,02, •■• are the descendants of (—1,0) and are situated at generation 0. 
Most of the notations concerning trees extend to forests: The lexicographical order < is defined 
on / by taking first the individuals of t\, next those of t?. ... etc and leaving (—1, 0) unordered. 
The genealogical order ^ on / is defined tree by tree in an obvious way. Let v' G /. The 
youngest common ancestor of v! and v' is then defined as the ^-maximal element of w' such 
that w' ^ v! and w' =4 v' and we keep denoting it by u' A v' . The number of children of u' is 
k u '(f) '■= k u (ti) and the forest / shifted at v! is defined as the tree 9 u '(f) '■= 9u(ti)- We also 
define [f\ u > as the forest {u'}L){v' G / : v' Au' 7^ u'} and we extend notations [/^...y and [f] n 
in an obvious way. For convenience of notation, we often identify / with the sequence (ti ; I > 1). 
When (t\ ; I > 1) = (t±, . . . , 0,0,...), we say that / is a finite forest with k elements and we 
abusively write / = (ii, ■ ■ ■ , tfc). 
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We define the set of sin-tree by 

T sin = {t G T : Vn > 0, # {v G t : \v\ = n and #9 v (t) = 00} = 1 } . 

Let t G T sin . For any n > 0, we denote by u*(t) the unique individual u on the infinite line 
of descent (i.e. such that #0 u (t) = 00) situated at height n. Observe that Ug(i) = 0. We 
use notation £oo(t) = n > 0} for the infinite line of descent of t and we denote by 

(l n (t) ; n > 1) the sequence of positive integers such that is the word l\{t) . . . l n (t) G U. We 

also introduce the set of sin-forests F s j n that is defined as the set of forests / = (ti ; I > 1) such 
that all the trees ti are finite except one sin-tree ti . We extend to sin- forests notations u*, and 
Z„ by setting l n (f) = l n (ti ), <(/) = (/ ,<(i/ ) ) and itg(/) = 0; o . 

Next, we introduce a natural class of random sin-trees called Galton-Watson trees with im- 
migration (GWI-trees for short). The distribution of a GWI-tree is characterized by 

• its offspring distribution fx on N that we suppose critical or subcritical: = X^fcxj^M^) ^ 1; 

• its dispatching distribution r defined on the first octant {(k, I) G N* x N* : 1 < I < k} that 
prescribes the distribution of the number of immigrants and their positions with respect to the 
infinite line of descent. 

More precisely, r is a GWI(/x, r)-tree if it satisfies the two following conditions: 

(i) The sequence S = ( (k u * ( t )(t), l n+ i(r)) ; n > 0) is i.i.d. with distribution r. 

(ii) Conditional on S, the trees u * n (T)i ( r ) with n G N and 1 < i < k u *^(r) with i 7^ Z n +i(r) 
are mutually independent GW(//)-trees. 

We define a GWI(/x, r)-forest with I > 1 elements by the forest if = (r, Ti, . . . , t/_i) where the 
Tj's are i.i.d. GW(^)-trees independent of the GWI(/x, r)-tree r. It will be sometimes convenient 
to insert r at random in the sequence (t±, . . . ,t/_i) but unless otherwise specified the random 
sin-tree in a random sin-forest occupies the first row. 

The word "immigration" comes from the following obvious observation: Let ^bea GWI(/x, re- 
forest with 1 + 1 elements. Set for any n > , Z n (<p) = #{u G tp : \u\ = n} — 1. Then the process 
(Z n (<p);n > 0) is a Galton-Watson process with immigration started at state I, with offspring 
distribution fj, and immigration distribution v given by 

v(k)= r(k+l,j), k>0. 
i<i<fe+i 

Recall that a Galton-Watson process with immigration (Z n (tp);n > 0) is a N- valued Markov 
chain whose transition probabilities are characterized by 

= fn(x) Z ^g (fn-i(x)) g (/„_ 2 (a0) • • • g(fo(x)), (3) 

where / (resp. g) stands for the generating function of /j (resp. v) and where f n is recursively 
defined by f n = / n _i o / , n > 1 and f = Id. 

We conclude this section by giving an elementary result on the so called GW(/j)- size-biased 
trees that are GWI(/j, r)-trees with dispatching distribution of the form r(k,j) = /j,(k)/p,, 1 < 
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j < k. Size-biased trees arise naturally by conditioning critical or subcritical GW-trees on non- 
extinction: see [1], [2], [9] or [11] for related results. The term "size-biased" can be justified by 
the following elementary result needed at Section 3.3: Let <p be a random forest corresponding 
to a sequence of I independent GW(/i)-trees and let (p\, be a GWI(/x, r)-forest with I elements 
where r is taken as above and where the position of the unique random sin-tree in (p^ is picked 
uniformly at random among the I possible choices. Check that for any nonnegative measurable 
functional G on F x U: 



and in particular dP([<^]n G ■)/dP([^] n G •) = Z n (ip)/lfj, n . 
2.2 The encoding of sin-trees 

The purpose of the paper is to provide a limit theorem for r e thanks to its encoding by two 
contour processes as briefly explained at the introduction. It will be convenient to introduce two 
additional encoding processes: namely the height process (also called exploration process) and a 
certain kind of random walk. 

Encoding of finite trees and forests. Let t G T be a finite tree and let uq = < u\ < . . . < 

u# t ~i be the vertices of t listed in the lexicographical order. We define the height process of t by 
H n (t) = \u n \, < n < #t. H(t) clearly characterizes the tree t. 

We also encode t by its contour process which is informally defined as follows: think of t as 
a graph embedded in the clockwise oriented half-plane with unit length edges; let run a particle 
starting at the root at time that explores t from the left to the right moving continuously 
along each edge at unit speed until it comes back to its starting point. In this evolution, each 
edge is crossed twice and the total amount of time needed to explore the tree is thus 2(#t — 1). 
The contour process C(t) = (C s (t); < s < 2(#i — 1)) is defined as the distance-from-the- 
root process of the particle at time s G [0, 2(#t — 1)]. More precisely, C(t) can be recovered 
from the height process by the following transform: Set b n = 2n — H n (t) for < n < #t and 
b #t = 2(#t - 1). Then observe that 



We also need to encode t in a third way by a path V{t) = (V n (t); < n < #t) that is defined 
by V n+ \(t) = V n (t) + k Un (t) — 1 and Vo(t) = 0. V{t) is sometimes called the Lukaciewicz path 
associated with t. It is clear that we can reconstruct t from V(t). Observe that the jumps of 
V(t) are > -1. Moreover V n {t) > for any < n < #t and V #t (t) = -1. We recall from [10] 
without proof the following formula that allows to write the height process as a functional of 




C s (t) 



H n (t) - s + b n if s G [b n , b n+ i - 1) and n < #t - 1 

s — o n +i + H n+1 (t) if s G [b 

n+i — 1> &n+i] and n < #t — 1 
H# t -i (t)-s + 6 #t _i if s G [b#t-i , b #t ] . 



(5) 



V(t): 




(6) 



8 



T. Duquesne. 



Limit of random walks on trees. 



Remark 2.2 If r is a critical or subcritical GW(/x)-tree, then it is clear from our definition that 
V(t) is a random walk started at that is stopped at —1 and whose jump distribution is given 
by p(k) = fi(k + 1) , k > —1. However neither H(t) nor C(r) are Markov processes except for 
the geometric case: H(k) = (1 - p)p k with p € (0, 1/2]. In this case, C(ip) is distributed as a 
random walk killed at —1 and whose possible jumps are (+1) with probability p and (—1) with 
probability 1 — p (more precisely it is the restriction of the first T_i — 1 steps of a random walks 
killed at the reaching time of level (— 1)). 

The previous definition of V and of the height process can be easily extended to a forest 
/ = (U ; I > 1) of finite trees as follows: Since all the trees t\ are finite, it is possible to list all the 
vertices of / but (—1,0) in the lexicographical order: uq = 0± < u± < . . . etc. We then simply 
define the height process of / by H n (f) = \u n \ and V(f) by V n+1 (f) = V n (f) + k Un (f) - 1 with 
Vo(f) = 0- Set n p = #ti + . . . + #t p and no = and observe that 

H np +k(f) = H k (t p+1 ) and V rip+k {f) = V k (t p+1 ) - p , < k < #t p+1 ,p>0. 

We thus see that the height process of / is the concatenation of the height processes of the trees 
composing /. Moreover the n-th visited vertex u n is in t p iff p = 1 — inf <fc< n V k (f). Then, it is 
easy to check that (6) remains true for every n > when H(t) and V(t) are replaced by resp. 
H(f) and V(f). 

Encodings of sin-trees. Let t £ T s i n . A particle visiting t in the lexicographical order never 
reaches the part of t at the right hand of the infinite line of descent. So we need two height 
processes or equivalently two contour processes to encode t. More precisely, the left part of t is 
the set {u £ t : 3v £ £oo{t) s.t. u < v}. It can be listed in a lexicographically increasing sequence 
of individuals denoted by = uo < u\ < . . .. We simply define the left height process of t by 
H n (t) = \u n \ , n > 0. H{t) completely encodes the left part of t. To encode the right part we 
consider the " mirror image" t* of t. More precisely, let v G t be the word C1C2 ■ • • c n . For any 
j < n, denote by Vj := C\ . . . Cj the j-th ancestor of v with vo = 0. Set c* = A;^_ 1 (t) — Cj + 1 
and v' = c\ . . . c* . We then define t* as {v°, v G t} and we define the right height process of t as 
H'(t):=H(f). 

Remark 2.3 Observe that r and r* have the same distribution if r is a GW(/x)-tree. This is 
not anymore the case if r is a GWI(ju, r)-tree unless r(k, m) = r(k, k — m + 1). 

We now give a decomposition of H{t) and H'(t) along £oo(t) that is well suited to GWI-trees and 
that is used at Section 3.2: Recall that (u n ;n > 0) stands for the sequence of vertices of the left 
part oft listed in the lexicographical order. Let us consider the set {w*_ 1 (i)i; 1 < i < l n {t); n > 1} 
of individuals at the left hand of £oo(t) having a brother on £oo(t). To avoid trivialities, we assume 
that this set is not empty and we denote by v± < V2 < ■ ■ ■ the (possibly finite) sequence of its 
elements listed in the lexicographical order. 

The forest f(t) = (0 Vl (i), 9 V2 (t), . . .) is then composed of the bushes rooted at the left hand of 
£oo(t) taken in the lexicographical order of their roots. Define L n (t) := (li(t) — l) + ... + (l n (t) — l) 
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, n > 1 with Lo(t) = and consider the p-th individual of f(t) with respect to the lexicographical 
order on f(t); check that the corresponding bush is rooted in t at height 

a{p) = inf{fc > : L k {t)>\- inf Vj( f(t) )} . 

Thus the corresponding individual in t is u n ( p ) where n(p) is given by 

n(p)=p + a{p) (7) 

(note that the first individual of f(t) is labelled by 0). Conversely, let us consider u n that 
is the n-th individual of the left part of t with respect to the lexicographical order on t. Set 
p( n ) = < n : Uk ^ ioo(t)} that is the number of individuals coming before u n and not 
belonging to £oo(t). Then 

p(n) = mf{p > : n(p) > n}. (8) 
and the desired decomposition follows: 

H n (t)=n-p(n)+H p(n) (f(t)). (9) 

Since n — p(n) = #{0 < k < n : Uk £ ^oo(^)}, we also get 

a(p(n) - 1) < n - p(n) < a(p(n)). (10) 

Observe that if w n ^ too{i), then n — p(n) = a(p(n)). The proofs of these identities follow from 
simple counting arguments and they are left to the reader (see Figure (1)). Similar formulas hold 
for H'(t) taking t* instead of t in (7), (8), (9) and (10). 

Remark 2.4 The latter decomposition is particularly usefull when we consider a GWI(/J,r)- 
tree r: In this case (f(r), f(r')) is independent of (L(r), L(r*)), /(r) and /(r*) are mutually 
independent and /(r) (resp. /(r*)) is a forest of i.i.d. GW(/x)-trees if for some k > 2 we have 
r(/c, 2) + . . . + r(/c, fc) ^ (resp. r(/c, A; — 1) + ... + r(k, 1) 7^ 0), it is otherwise an empty forest. 
Moreover, the process (L(r), L(j')) is a N x N-valued random walk whose jump distribution is 
given by 

P (L n+ i(r) - L„(t) = m ; L n+ i(r*) - L n (r # ) = m) = r(m + m' + 1, m + 1). 

We next define the left contour process of the sin-tree t denoted by C(t) as the distance- from- 
the-root process of a particle starting at the root and moving clockwise on t viewed as a planar 
graph embedded in the oriented half plane with edges of unit length. We define C'(t) as the 
contour process corresponding to the anticlockwise journey and we can also write C{t*) = C'(t). 
More precisely, C(t) (resp. C'(t)) can be recovered from H(t) (resp. H'(t)) through (5) that 
still holds for sin-trees (note that in that case the sequence (6 n ;n > 0) is infinite). 

It will be sometimes convenient to approximate a sin-tree t by the finite tree [i] u *(t) with 
n large. The formula connecting the contour processes of t and [i] u *(t) is given as follows: Set 
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10 , 7* 



14 , 10* 



\ 



\ 



2 , 1* 



3 , 2* 



1 , 0* 




15 







Figure 1: The left part of a sin-tree t. The individuals which are not on £oo(t) have two labels: the first 
one is their row in the lexicographical order on t and the second one (taged with a star) corresponds to 
their row in f(t); individuals of £oo(t) have only one label corresponding to their row in t. 



<7n(t) = #{u et : u < <(t)} and a n (f) = #{u 6 t* : u < <(*•)}■ We get a n (t) + a n {f) = 
#[t] u *(t) + n — 1 since the individuals of [[0, ^_i(i)]] have been counted twice. Check that 

a n (t) = sup{/c > : H k (t) < n] and 2a n (t) - n = sup{s > : C s (t) < n}, (11) 

with similar formulas for t*. Thus we get 



(Observe that a similar formula is not available for height processes.) 
2.3 Statement of the main result 

For convenience of notation, we set d = 1/2 — e and u = 1/2 + e. Recall that r e G T denotes the 
random ordered rooted tree associated with the range of the random walk W e in U. First observe 
that the process (|W^|;n > 0) giving the distance from the root of the particle performing the 
random walk does contain an important part of the information concerning r e . Moreover, this 




(12) 
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process is simply distributed as the post-infimum path of a random walk whose possible jumps 
are +1 with probability u and —1 with probability d. Recall that (B s ;s > 0) stands for the 
linear Brownian motion and set for any y G R, B^ = B s +ys and 1^ = inf u < s B$\ Standard 
arguments imply 



W, 



°)7S*«-e»>o) 



where we have set g = inf{s > : B^ = 1^}. Notation stands for the convergence in 
distribution in the appropriate space of right-continuous functions with left limits endowed with 

Skorohod topology. We also use notation ^— > for the convergence in distribution of all finite 
dimensional marginals. 

This result turns out to provide the right scaling for r e though the connection between 
(|W^| ;n > 0) and r e is non-trivial and the distribution of r e is not simple: for instance, we 
can check that r e and r* might not have the same distribution. Take the binary case a = 
(a, 1 - a, 0,0,...) for some a G (0, 1). Define the set A C T by A = {t G T : k (t) = 2, k^t) = 
0, k,2{t) > 0}. Then it follows from simple arguments discussed in Section 3.1 that 

P(r £ G A) = , ^ ~ a j , and P(r* E A) — du*a{l-a) 



(u + da)(u + d 2 a) Ve ; (u + d(l - a))(u + d 2 (l - a)) ' 

Thus, except for a = 1/2, P(r e G A) ^ P(t* G A). Actually, when e goes to zero, the particle 
backtracks more and more often causing correlations. However, Theorem 2.1 asserts that the 
correlations only give rise to a deterministic phenomenon that is taken into account by the 
coefficient 7 = 7(a) given by (1). 

Theorem 2.1 Let D and D* be two independent copies of B^~ 2 ^ — 2I^ 2 \ Then, 
« (eC a/ Are) , " C 's/Are)) s>0 {2D rs , 2D' 1S ) 



's>0 



s>0 



Let us make some comments, (a) The limit of the height and the contour processes are the same 
up to the multiplicative time constant 2. This comes from the fact that vertices are visited once 
by the height process while the edges are crossed exactly twice by the contour process. 

(b) The definition of 7 through expectation (1) is only for pratical reasons. We have not 
found a simpler expression except for the case a± = . . . = a& = 1/6 where b is an integer greater 
than 1. In that case the Xj's are deterministic and 7 = 1 — 1/6. 

(c) The continuum random sin-tree whose 2D( 7 .) and 2D'^ ^ are resp. the left and the right 
height processes can be defined as follows: To any real s corresponds a vertex in the tree at 
height H s = l(_oo,o)( s ) 2-D_ 7S + l[o,oo)( s ) 2-D* s . Let s < s'. The youngest common ancestor of 
the vertices corresponding to s and s' is situated at height 

m(s, s') = ini{H u ; u G I(s, s')}, 
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where I(s, s') is taken as [s, s'] if ^ [s, s'] and as R\ [s, s'} otherwise. Thus, the distance between 
the vertices correponding to s and s' is 



We say that s and s' are equivalent if they correspond to the same vertex in the tree, i.e. 
d(s, s') = that is denoted by s ~ s' . We formally define the continuum random sin-tree as the 
quotient set T = R/ ~. Then d induces a metric on T that makes it be a (random) Polish space. 

We can show that the metric space (T, d) is a M-tree (see [6] for related results). Due to 
the Brownian nature of H, all fractal dimensions of T are a.s. equal to 2. A point a G T is 
said a branching point if the open set T \ {a} has more than two connected components and it 
corresponds to times at which H reaches a local minimum. Since all the local minima of H are 
distinct, all the branching points are binary, i.e. T \ {a} has three connected components. 

(d) Observe that the limiting tree T is symmetric since D and D' have the same distribution. 
An heuristic explanation is the following: arguments discussed at Section 3.2 imply that an 
unbalanced set of weights a breaks the symmetry of T e only if r e has branching points of order 
> 3 which does not happen to the limiting tree T that is binary. 

3 Proof of the main result 
3.1 Combinatorial results 

In this section e is fixed and for convenience of notation we drop the corresponding subscript 
in the random variables. Thus, we write W and r instead of W e and r e . As explained in the 
introduction, the linear interpolation of the process (|W n |;n > 0) can be viewed as the left 
contour process of a (Active) GWI-tree denoted by r and whose distribution is given by the 
following lemma. 

Lemma 3.1 The linear interpolation of (\W n \ ;n > 0) is distributed as the left contour process 
of a GWI(n,r)-tree where fi(k) = ud k , r(k,k) = fi(k — 1) and r(k,m) = 0,l<m<k,k>0. 

Proof : Let (£ n ;n > 1) be i.i.d. such that P(£ n = 1) = u and P(£„ = -1) = d. Set 50 = 
and S n = £i + • • • + £,n and define T_i as T_i := inf{n > : S n = —1} (with the convention 
inf0 = oo). Since the random walk S = (S n ;n > 0) a.s. drifts to +oo , -P(T_i = oo) > 0. By 
definition of W, (\W n \ ; n > 0) has the same distribution as S under P( • |T_i = oo). 

Let us denote by (T-°^;i > 0) the passage times to state 0: Tq ^ = and = inf{n > 
T,^ : S n = 0}, with the convention inf = oo. Set 



d(s, s') = H s + H s , - 2m(s, s'). 



We denote by £±, . . . , 



K = sup |i > : Tr' < oo j < oo a.s. 
£ki £k+i the excursions of S away from defined by 
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and by £r+i = (SL(o) , ;n>0). We first consider the tree r whose contour process is the linear 
interpolation of (S n ; < n < T_i — 1) under P( • |T_i < oo): 

Claim: r is a GW(/x)-tree. 

Proof: If L stands for the number of children of the root of r, then L + 1 is also the number of 
times S visits before T_i: L = sup{i > : T- ^ < X!_i}. By applying the Markov property 

at the stopping times X^'s, we show that P(L = 0) = d and that for any / > 1, conditional on 
the event {L = I ; T_i < oo}: 

• (a) £\, . . . ,£i are i.i.d. and they are distributed as £\ under P( • \£\{\) = 1 ; < oo). 
Moreover, the Markov property at time 1 implies that 

• (b) {£i{n + 1) - 1 ; < n < T 1 (0) - 2) under P( • |fi(l) = 1 ; T 1 (0) < oo) has the same 
distribution as (5 n ; < n < T_i — 1) under P( • |T_i < oo). 

Now observe that the contour processes of the subtrees 6\t, . . . , 6lt are the linear interpolations 
of (£i(n + 1) — 1 ; < n < Q — 2) , 1 < i < L. We deduce from (a) and (b) that r satisfies the 
two conditions of the definition of a GW-tree; its distribution is then the distribution of L under 
P( • |T_i < oo), which can be computed as follows: Observe first that 

{L = l; T_i <oo} = {f 1 (l) = l;T 1 (0) < oo ; . . . ; ^(1) = l;r/ 0) < oo ; £ l+1 (l) = -1}. 

Then, by (a): 

P(L = I ; T_i < oo) = cZP(5i(l) = 1 ; < oof. 

But (b) implies that P(£i(l) = l;^ < oo) = mP(T_i < oo). Thus, 

P(L = l; T_i < oo) = d(uP(T„i < oo)) fc 

and by summing over ^ we get P(T_i < oo) = <i/(l — uP(T_i < oo)) which implies that 
P(T_i < oo) = cZ/n. Finally we get 

P(L = Z|T_i < oo) = ud k = n(k), 

which achieves the proof of the claim. ■ 

Let us achieve the proof of the lemma: we now consider the tree r whose contour process 
is the linear interpolation of (\W n \ ;n > 0). To simplify notations, we identify this process to S 
under P( • \T-\ = oo). Then K + 1 is the number of children of the ancestor of r. First, observe 
that 

{K = k ; T_i = oo} = {£i(l) = 1; < oo ; ... ; £ k (l) = 1; T fc (0) < oo ; T fe ( °\ = oo}. (13) 
By applying the Markov property, we then show that conditional on {K = k ; T_i = oo} 
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• £i, . . . , £k+i are independent; 

• £\,...,£k are distributed as £\ under P( • \£\(X) = 1;^°^ < oo); 

• £k+i is distributed as £\ under P( • \T^ = oo) 

Now, by applying the Markov property at time 1 we see that 

(£i(n + 1) - 1; n > 0) under P( • \T^ 0) = oo) ( '= } S under P( • |T_i = oo). (14) 

Observe that the contour processes of the subtrees 6if, . . . , 0k+it are the linear interpolations 
of (£i(n + 1) — 1 ; < n < Q — 2) , 1 < i < K + 1. Deduce from (b), from the previous claim and 
from (14) that conditional on {K = k ; T_i = oo}, the subtrees 0\t, . . . ,6kf are k independent 
GW(«)-trees and that 9k+iT is distributed as f. It implies that f satisfies the two conditions of 
the definition of GW /-trees. Since the infinite subtree is 9k+iT, r is a G WI(u, r)-tree with 

r{k + l,m) = , 1 < m < k + 1 and r(k + 1, k + 1) = P(K = fc|T_i = oo) , k > 0, 

which can be computed as follows: Deduce from (13) and the Markov property 

P(K = k ; T_i = oo) = P(5i(l) = 1 ; Tj® < oo) k P(T} 0) = oo). 

Now observe that P(Tj® = oo) = uP(T_i = oo) and that P(£i(l) = 1 ; T 1 (0) < oo) = d. Thus 
P(K = fc|T_i = oo) = /J,(k), k > 0, which achieves the proof of the lemma. ■ 

Observe that r is completely asymmetric i.e. it has no vertices at the right hand of its infinite 
line of descent. Note also that its immigration distribution v is equal to fx. In what follows, we 
explain how to recover the full range {W n ; n > 0} from r. To that end we need to label r by 
random marks in N* as explained in the introduction. Let us introduce some notation: the set 
T = (t ; (m u ,u € t)) is a N*-marked tree T if t G T and if m u £ N*, u G t. The m n 's are the 
marks of T. The set of N*-marked trees is denoted by Tpj* . We define the track of T as the 
mapping Try : t — > U defined as follows: Let u G t; if we denote by no = =^ ui ^ . . . =<! u n = u 
the ancestors of u, then we define Try (it) as the word m Ul ...m Un G U, with the convention 
Trr(0) = (observe that m plays no role in the definition of Tr^). 

Similarly we define marked forets as sets of the form F = (/; (m u , u G /)) where / G F and 
m u G N*. The set of marked forests is denoted by Fpj*. We define the track Trp of F exactly as 
we have defined the track of marked trees and we set for any u G F 

Ou(F) = (e u (f);(m uv ,vee u (f))) and [F] u = {[f] u ; {m v ,v G [/]„)) . 

Since the linear interpolation of the process (|W n |;n > 0) is the distance-from-the-root 
process of a (Active) particle exploring continuously r at unit speed from left to right r, we can 
associate with each vertex u G r \ {0} a unique time n(u) G N such that the (Active) particle 
climbs the edge ( u ,u) between times n(u) and n(u) + 1. Since |W n ( u ) +1 | = 1 + |W n ( u )|, we can 
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find fi u G N* such that the word W n / u )+i is written W n ( u )// U 6 U. We then define the random 
marked tree T as 

T = (t; (7Z u ,«€r)), 

where the mark of the root ~p is taken independent of W and distributed on N* in accordance 
with the set of weights a: P(/7 = i) = a^, i G N* The distribution of T is described by the 
elementary lemma whose proof is left to the reader. 

Lemma 3.2 Conditional on T, the marks (~p u ,u G r) are independent and distributed in accor- 
dance with a. Moreover, 

Tr T (r) = {W n ■ n > 0}. (15) 

As already explained in the introduction, to take the track of T is a procedure that can be 
broken up in two distinct sub-procedures: The first one "shuffles" r by putting its edges in a 
certain random order. The second one "shrinks" r by identifying some successive edges with 
respect to the new random order. Let us first specify what we mean by shuffling: Let t G T; we 
say that p = (p u , u G t) is a permutation of t if each p u is a permutation of the (possibly empty) 
set {1, . . . , k u (t)}. Let u G t be the word c\ . . . c n . We denote by Uk = c\ . . . Ck the fc-th ancestor 
of u. We define the word u p by p Uo (ci) ■ ■ •p Un _ 1 (c n ) G U if u ^ and by otherwise. We set 
t p = {vP; u G t}. Now, pick uniformly at random a permutation tt of t among the Y\ u& k u (t)\ 
possible ones. We define the shuffling of t as the random tree Sh(t) := t w . 

Remark 3.1 Shuffling a GW-tree does not change its distribution. It is also easy to check that 
Sh(r) is a GWI(/x, r')-tree with r' given by r'(k,j) = ud k ~ 1 /k, 1 < j < k. 

We would like to shuffle a N*-marked tree T = (t ; (m u , G t)) in accordance with the order of 
its marks in N*: for any permutation p of t, set T p = (t p ; (m uP ,u G t)) and observe that 

Tr TP (t p ) = Tr T (t). (16) 

Let 7r(T) = (ir u , u G t) be a random permutation of t such that the 7r n 's are mutually independent 
and ir u is picked uniformly at random among the permutations a of {1, . . . ,k u (t)} satisfying 

m ua (i) < m ury (2) < ... < m ua ( ku ^y 

We define the shuffling of T as Sh(T) := T w( - T \ By definition the mapping Tr Sh(T) : t 77 ^ -» U 
is increasing with respect to the lexicographical order: 

\fu,v G t^ T) , u < v => Tr Sh (T)H < Tr SHT )(v). (17) 

Observe that if any brothers in T have distinct marks, then n(T) is deterministic. Thus, 

has clearly not the same distribution as Sh(i). However when the marks m u , u G t are i.i.d. 

random variables, we can easily check that is distributed as Sh(£). Thus, if we set 

T = Sh(T) :=(r ; (^.uer)), 
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then, we deduce from the previous observation that 

Tv f (r) = {W n ; n > 0} , r ( '= } Sh(r) (18) 

and that 

Vu,uGr, u < v Tr^(u) < Tr f (v). (19) 

So, we first obtain r by shuffling the G Wl-tree f and then by identifying the edges of the resulting 
marked tree that have the same random marks. We now give estimates in Proposition 3.3 and 
in Proposition 3.4 on how much this edge identification does shrink r. Let us introduce some 
notations: with any marked forest F = (/; (m u ,u G /)) we associate a collection (Z V (F); u£D) 
of integers defined by 

Z V (F) = #{uef: Tr F (u) = v}. 

Some key estimates in the proof of Theorem 2.1 rely on a precise computation of the law of the 
Z v (F) , s when F is distributed as a GW-forest or a GWI-forest. From now on until the end 
of the paper all the G W or G Wl-forests that we consider share the same offspring distribution 
ji{k) = ud k , k > 0. We set for any % G N* and for any x G [0, 1] 



fix) :=J2ud k x k = and fi(x) := /(l - a t + a iX ). 



u 

ud"x" = — 

k>0 

For any v = mi . . . m n G U we also define 

fv - = fmi o . . . o f mn and a v := a mi . . . a mn , 

with fes = Id and a = 1. We adopt the following convention: to simplify notation, we 
do not distinguish constants in inequalities and we denote them in a generic way by a symbol 
Ka,/3,... meaning that we bound by a positive constant that only depends on parameters a, j3, . . . 
etc. 

We first describe the law of (Z V [T)\ v G U) with T = ((p ; u G <p)), where <p = (ri, . . . , 77) 
is a forest of I i.i.d GW(/i)-trees and where conditional on ip the marks (fJ, u ,u G tp) are taken 
mutually independent and distributed in accordance with a. 

Proposition 3.3 (i) For any v,w G U, 



E 



JMD\z v (F)] =f w {x) z ^). 



(ii) Moreover for any v = mi . . . m n G U, 

l-fv(l~x) = A{v)x X +B{v) with l/B(v)=a v (d/uT 

and 

. . . U 1 /U\ 2 1 1 

A(v) = l + - + - + ...+ 



d CL mi \dJ tt mi CL m2 \d/ Ami ■ • • flm„-i 
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(Hi) For any positive integer p, 



E 



j'6! 



V 



d/u 



Proof : We first show (i) whose proof reduces to the "Z = 1" case by an immediate independence 
argument. Let us take T = T\ = (t\ ; (fi u , u G ri)) and dGD. Consider the set C v of the vertices 
u G 7~i satisfying Tr^w) = f. We denote by u\ < . . . < uz v it 1 ) the elements of C v listed in 
the lexicographical order. As a consequence of Remark 2.1, we see that conditional on C v the 
marked trees (9 Ui (Ti), 1 < i < Z v (T\j) are i.i.d. marked trees distributed as T\. Observe next 
that for any i»eU 

Z„(Ti) 

^(Ti) = £ #{u G ^(Ti) : Tr^ (Tl) ( U ) = w}. 



i=l 



So we get 



E 



x^Wi) | z w (Ti; 



E 



„Z w (Ti) 



Then it remains to prove: E[x z ™ ( -' ri )] = f w (x), which follows from iterating the previous identity 
and from the easy observation: Efx^ 7 ^] = fi(x), i > 1. 

The proof of (ii) is a simple recurrence. Let us prove (Hi): for any positive integer p and any 
v = mi . . . m n G U, we deduce from (ii) the following inequality 



P 



' B(v) \B(v) 



p-i 



'J\ \v\ 

< (^) (l-a + ) 1 - p , 



(20) 
(21) 



where we have set a + = maxj>i <2j < 1. For any integer i we denote by (x)i the factorial 
polynomial x(x — 1) . . . (x — i + 1) (with the convention: (x)o = 1). Check recursively that for 
any l,p > 1 and any h G C°°(R,M), 



dPh 1 
dxP 



(22) 



where the Qj,pS are j-homogeneous polynomials with N- valued coefficients that only depend on 
j and p. Deduce from (21) that for any v G U, 



E[(Z V (F)) P ] = 



dxP 
v 



(23) 
(24) 
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7=1 



Q jlP (l!,2!(l 



a + )- 1 ,...,p!(l-a + ) 1 ^) 



(25) 



< K aiP Fa. 




(26) 




(27) 



'V 




l-d/u' 



1 



(28) 



We need similar results for GWI-forests. Let r be some fixed repartition probability measure. 
We denote by v the corresponding immigration distribution given by v(k — 1) = Yli<j<k r (^'i) 
, k > 1. For any x G [0, 1] and any i G N* we write 



Let = (ifo; (fi u , u G (po)) be a random marked GWI-forest whose distribution is characterized 
as follows: ipo = (to, t\, . . . 77), the Tj's are mutually independent, t±,...,ti are i.i.d. GW(/i)- 
trees, To is a GWI(/U, r)-tree and conditional on (po the marks fj, u are i.i.d. random variables 
distributed in accordance with a. For convenience of notation, we set 



We also set Sp = {v *i , i G N* \ {/U u * } , n > 0} and we define 5 as the a-field generated by the 
random variables (// u * ;n > 0) and (Z^(^o) \w G Sp). 

Proposition 3.4 (i) Conditional on S, the collection of the V-indexed processes 

( (Zwv^q); v G U) ; w G Sp) are mutually independent. Moreover, for any w G Sp, i/ie process 

(Zwv^o)-^ G U) on/?/ depends on S through Z w {Tq). More precisely, 



where T = (99; (/-i u ,u G 93)), where ip is a sequence of I i.i.d. GW([i)-trees and where conditional 
on if the marks (fiu,u G (p) are i.i.d. distributed in accordance with a. 




u* n = u* n {<po) and v* n 



TWO , n>0. 



(Z WV (F ); v G U) under P( • |w G Sp ; Z w (^ ) = = (^C^ v G U) 
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(ii) For any p > 1, any n > 0, 



and for any i G N* ; 



E [Z v * n (T y] < K^ p (I + If max g^(lf 



E [Z^of] < K^ai (I + If max g^(lf, 

0<j<p 



with the convention = g. 
(Hi) For any p > 1 and any n > 0, 



E 



£A «r^ (, + 1) 'o?|f/ (,, ( 1) ' 



Proof : Set for any w G Sp, C w — \u G (fo '■ Tic^p Q (u) — kj}. Then by definition, Z w (J~q) — ^£ w . 
Check that 

Mu^v! G C w : u, v! £ ^oo(vo) and u A u ^ {u, u'}. 
weSp 

These two observations combined with Remark 2.1 imply that conditional on <S the marked trees 
Gui^o), u G UweSp'^w are i-i-d. marked GW(//)-trees with independent marks distributed in 
accordance with a. This implies (i) thanks to the following equality valid for any w G Sp and 

any v G U: 

Let us prove (ii): Suppose that the word «* is written l\ . . . l n G U for some nonnegative integers 
h,. . . ,l n . Consider u G (po such that Trjr (u) = -u*. There are three cases: 

• If \u A u* | = n then ix = it* . 

• If |it A it* | = —1, then itA-u* is the Active root (—1,0). Thus, the ancestor 0o of the 
sin-tree tq is not an ancestor of u. It implies 

I 

#{uG<A) : |uA<| = -l and Tr To (u) = v* n } = ^ Z v . (e 0j (Fo)) . 

i=i 

• If iz A it* = it*, with < < n, we can find some j G {1, . . . , (<£>o)} with j ^ l^+i and 
some u'eU such that 

u = u* k ju' , Hulj = Hu* k+1 and Tr e ^ o) (v!) = w* k+l , 
where w k+1 G U stands for the word /^«* +2 • • • G U, with the convention u;* = 0. 
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Now, set for any < k < n, 

E k +i = {j € {1, • . . , k u *(ip )} : j ^ l k+1 and n u *j = 

The combination of the three preceding cases implies that 

l n-l 

^*(^o) = i + E^<(M^o))+E E #Ve^W:% ra M = <i}. 

j=l k=0j&E k+1 

Set Kk = #Ek , 1 < k < n, kq = I and Wq = u*. Then, by (i) we get 



E 



- Z <(^o) | (/c fc+1 ,^ u . +i ) <fc<n-l 



n-l 



fc=0 

n 



(29) 
(30) 



fc=0 



It also follows from the previous observations that are mutually independent with the 

same distribution specified by 



E[x Kl ] = E [g^{x)\ = J2 a l9ai (x). 
Prom Proposition 3.3 (ii) we get a.s. 



(31) 



ieN* 



f wi (l + z) = l + 



B(wl) V B(w* k ) 



and since 



A(u>£) . <i .|,„*i d , 



B(w k ) ,i 



/^♦(1 + z) has a.s. a power serie expansion with a radius of convergence greater than 1 — a + > 0. 
Then, for any \z\ < 1 — a+ we can write 



i & p f1 
pi dzP 



f wl (l + zT* = l + Y,D^zV with D$ 



(k) 



p>i 



(!)• 



Deduce from (26) 



i #73 

- p\ dzP 



(1) < K p ^K p k a K (J?J 
— ^P.a K k "+ • 



(32) 
(33) 
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Then observe that 



where 



k=0 p>l 



D r . 



E E n 

Pc{o,.,n} E fc£P fe=p 



Set Do = 1 and deduce from (30) 



E 



(X» (T )) p I (Kfc,/i u *)o<fe<n =p! {Dp + D, p -i 



(34) 



Use (33) and the independence of the Kj's to get 



E [-Dp] < E E II^aE[ 

•PC{0,...,n}E fce7 p<7fc=P fc eP 

<?k>l 



If C {0, . . . , n} and SfceP qk = P with qk > I, k e V, then #7^ < p and qk <P for any k eV. 
Thus, 

IIe[k?] <(/ + ir(lVE[ K ?]f 

fce-p 

since «i, . . . , K n are identically distributed and kq = I. Deduce from (31): 

1 V EK] < K a , p max (l,</(l), . . . , g {p) (1)) . 

Thus, 



since 



V[D p }<K a , p (l + iymaxgti\iy 

0<3<P 



e e n < _fc < ^ u - « + r p - 

7>C{0,...,n}£ fce p9fc=P fceP 
<?fe>l 



(35) 



Then by (34) and an easy argument 



E [Z^of] < K a>p (I + If max ^ (If. 



(36) 



To achieve the proof of (ii), we set C v * = {u G (fo \ {u* n } : Trjr (u) = i>*}. Recall 
that any u G -Cd* has offspring distribution \x and that it^j has offspring distribution i/. Since 
Z v * = 1 + #£ v * , we get for any i G N* , 



E 



n 1 r <*n+l 



L {t* u * =*} r , \Z * —1 I \ 

"+ 1 fi(xr< l gi(x). 
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Thus, 



E 



x z <> | Z K = k + ll = (1 - ai + <nx)fi(x) k gi(x). 



By differentiating p times at x = 1 we get 



E 



dxP 



(Z v *.) I Z v * = k + 1 
Now observe that for any q >0,^(l) = a^)(l) and 



dP ^(D + «^(i) 



dx« 



■(1) = (oid/«)«(fc + q - l) q , 



by a simple computation. Thus, 



dxP w ^ 

q=0 



q\(p-q)\ 



a p - q 9 {p - q \l)(aid/uy(k + q-l\ 



Consequently, 



E 



< K p a P (k+p) p max g®(l). 



Since p > 1 and by (36) we get 

which easily implies the second inequality of Lemma 3.4 (ii). 
We now prove (Hi): first observe that the decomposition 

E Z V ([J 7 ] K ) P = ei + e 2 + e 3 

holds with 



e l = E £^«w([.Fo]u') P . 



|u)|<n 



£2= Yl Z «je([^o]u') p and e 3 = ^ Z <t) ([^o]<) p - 
0<fc<n «eu 

Note for any v G U and for any w E Sp such that \w\ < n that ^^([^bju* ) = Z wv (Tq). Then 
by Proposition 3.3 (i) and Proposition 3.4 (Hi) 



e [ ei 1 5] < k^ p y, < k & , p (i - d/u)- 1 E Z ^r 



weSp 

\w\<n 



d/u 



(37) 
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We then deduce from the second inequality of Lemma 3.4 (ii) 

E M < K a , p (I + If max ff 0)(lf^±L. (38) 
o<j<p l — d/u 

Observe next that Z v * ([Folu^ ) = Z v *{Tq). Then by the first inequality of (ii), we get 

E [e 2 ] <K ap n(l + l) p max <? (j) (l) p . (39) 

0<j<P 

To bound E[e3], note that conditional on Z v * ([^b]u* ) = ^ the process (^^([.T^Im* ) ; v G U) 
is distributed as (Z V (T)\ v G U) where = (ip;(n u ,u G 99)), where </? is a sequence of I 
independent GW(/u)-trees and where conditional on (p the marks (lJi u ,u G <£>) are i.i.d. random 
variables distributed in accordance with a. Thus, by Proposition 3.3 : 

E[e3|^([^ok)]<^a /f^ )P . 
Now observe that Z V ^(T§) = Z v ^ ([.Fok* ) and use (36) to get 

«H<^«^^ (40) 
Then, (w) follows by adding (38), (39) and (40). ■ 
3.2 Proof of Theorem 2.1. 

Let us first explain why Theorem 2.1 reduces to a convergence for finite trees: we restore e in 
the random variables: T e = (T t ;(~p u ,u G r e )) and 7^ = Sh(T e ) = (t € ;(J1 u ,u G r e )). For any 
positive real number x we set x e = \x/e\ and we define Cx,e = sup{n > : \W*\ < x t }. As 
explained in the introduction, we associate a unique finite ordered rooted tree rf with the subtree 
{W^, < n < Cx,t} C U. Observe that in general rf 7^ [t £ ] m » ( Te ), however rf and r e coincide up 
to level x e : 

[<k = [r e ]x.. (41) 



The following proposition asserts that the convergence of r e is equivalent to the convergence of 
the rf s for all x > 0. For convenience of notation, we set ( X:6 = 2e 2 #rf and 

H s (x,e) = el[0,C E , e )(») fl 'L»/2e 2 j( 7 f) aIld ^( X ' e ) = 6 1 [0,C,e-2e 2 ] (*) Gs/e2 «)• 

We also define the limiting process by 

= l [0 , aa] (a)D a + l [(T , !00) ( S )^ Cx _ s)+ , 
where = + c* with da; (resp. a*) = sup{s > : D s (resp. D* s ) < x}. 
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Proposition 3.5 Theorem 2.1 is implied by any of the following equivalent convergences 

(i) Vx > , C(x, e) ( ^ ) (2D$; s > o) 

(ii) Vx > , H(x, e) ( ^ . (2Dg); s > o) . 

Proof: The proof of (ii) => (i) can be copied from the proof of Theorem 2.4.1 [5]. It relies on 
formula (5) that makes the contour process of a finite ordered rooted tree an explicit functional of 
the corresponding height process. Since (5) also holds for contour processes of sin-trees, similar 
arguments work to show that Theorem 2.1 (ii) implies Theorem 2.1 (i). Let us prove that 
Proposition 3.5 (i) implies Proposition 3.5 (ii): Recall from (5) that 

H n (T*) = C 2n -H n {T?)(Te)- 

So, if we denote by 5(e) the maximal height of rf we get 

sup \H n (r x )-C 2n (rf)\< max |C„(7f)-C n ,(7f)|, 

n<#Tf \n-n'\<S{e) 
which implies after scaling 

sup I H s (x,e) — C s (x,e) I < max I C s (x,e) — C s '(x,e) I . 

s<( x , t \s-s'\<e^S(e) 

Proposition 3.5 (i) implies that eS(e) converges in distribution to the supremum of that is 
a.s. finite. Thus, the right member of the latter inequality converges to zero in probability and 
Proposition 3.5 (ii) follows. A similar argument show that Theorem 2.1 (i) implies Theorem 2.1 
(ii). Now, the proof will be achieved if we show that Proposition 3.5 (ii) implies Theorem 2.1 
(ii): Assume that Proposition 3.5 (ii) is true and deduce from (41) that 

^H lsAexf/2t 2 } (T e ) , e J ff[ , iAe . ie/2e2J (r e ))^ o ^ > (2£> 7(sA e.) , 2D 9 l(sAe .^ ^, (42) 

where e X;t = inf{n > : H n (r x ) > x t } and e x = inf{s > : D s > x} with similar definitions 
for e* e and e* . Observe that Proposition 3.5 (ii) implies for any x > that (e^, e* J converges 
in distribution to (e^, e*). Since e x and e* a.s. go to infinity with x, we then get for any M > 0, 

lim lim sup P (e Xj£ < M ; e' x e < M) = 0, 

which implies Theorem 2.1 (ii) by (42) and by standard arguments. ■ 
We define ~T X = \r e ] u * p= e ) and rf = [T e ] u * (r e ) and we also set 

^6 = \7e] Ke (r e ) = (r x t ;(jX u ,ueT x t )) and f x = [T e ] M * £ {%) = (t x ; (jl u , u e rf )) . 
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By definition, #rf = #rf = Cx,e- Deduce from (18) and (19) 

Tr fe (T?) = {W<;0<n<( x , £ } , 5f ( '= } Sh(r^) (43) 

and 

Vn,wGrf : it < v Tr^ (it) < Tr^- (u) . (44) 
By Proposition 3.5, Theorem 2.1 reduces to prove that for any x > 0: 

The first step of the proof of (45) is a limit theorem for rf : let us set for any s 6 [0, oo) 

H s (x,e) =€l[ 0> 2 e 3#7?)(s)fl'L*/2e 2 j( 5 f) and = el [0,2 e 2(#ff-l)](s)C S A2(T?)- 

Lemma 3.6 

(i) Vx > , e) (d) Q ) s > o) 

(ii) Vx > , e) (d) Q > (2D^; s > o) 
Proof : Deduce from (9) 

fl n (r e ) = n-p e (n) + fT pe(n) (/(r e )), n > 0. (46) 

Recall that /(r £ ) stands for the forest composed by the bushes rooted at the left hand of the 
infinite line of descent of r e and that p e (n) is given by p e (n) = inf{p > : n e (p) > n} where 

n e (p) =p + inf{k > : L k (r e ) > - inf Wj(f(%))} 

with L n (r e ) = (h(r t ) — 1) + . . . + (l n (%) - 1) , R > 1 and Lo(r e ) = 0. By Remark 3.1, % is a 
GWI(/x,r')-tree with r'(fc,Z) = ud k ~ 1 /k, 1 <l <k. Thus by Remark 2.4: 

— the two forests (/(r e ), f(r')) are independent of (L(r e ), L(t*)); 

— /(r e ) and /(r*) are two mutually independent sequence of i.i.d GW(fx) -trees; 

— (L(r e ), L(t°)) is a N x N-valued random walk whose jump distribution is given by 

P (L n+ i(r e ) - L n (r e ) = I ; L n+1 (r e *) - L n (5?) = /') = - - * - ^ 
Check first that E[L n (r e )] = E[L n (r*)] = nd/2u, which implies 

(eL s/e (r e ) , eL s/e (?;))^ — ^ (s/2 , s/2) s > • (47) 
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Next, we need to prove the joint convergence of (e i?|s/2e 2 J ((^e))> 6 V[s/2e 2 \ (/(^e))) : We know 
from Remark 2.2 that (V^(/(r e )); p > 0) is a random walk with jump distribution given by 
p(k) = ud k+1 , k > — 1. An elementary computation implies for any A £ IR that 

E [exp (iAe V Ls/2e2J (/(r e )))] = exp (-— - 2i\s\ + o(l) 
and by standard arguments 

{ev [s/2 ^u(m s >_o-^r B{ - 2) (48) 

(see for instance Theorem 2.7 [19]). We then use Theorem 2.3.1 [5] that asserts that under (48) 
the following joint convergence 

(e# Ls/2e2J (/(?,)) , eV Vs/2 ^{f{r e ))) s ^ -£U ( 2 (i?(- 2 ) - 1^) , 2?(- 2 >) (49) 

holds provided that for any 5 > 0, 

hminf(/ L5AJ (0)) LlAJ >0 (50) 

(recall that f n is recursively defined by /„ = f n -i /)• Check that 

f , s _ u 1 - Cu/cQ n - x(l - (u/d)" -1 ) 
MXj ~ d 1 - (u/d) n+1 - x (1 - (u/d) n ) ' 

Then, 

J^ (/L^j(0)) Ll/eJ =exp(-^) > 
and (49) follows from (50). Recall notation a from Section 2.2 and observe that 
ea([s/2e 2 ]) = inf{ S ' > : eL Li , /eJ (r e ) > - inf eV 2 j(/^))}' 

Deduce from (7) that: 

2e 2 n e (Ls/2e 2 J) = 2e 2 Ls/2e 2 J +2e 2 a(Ls/2e 2 J). 

Then by (47) and (49) 

(ea([ S /2e 2 \)) s>Q ^-^-2I^ and (2e 2 n t ([s/2e\) ; s > 0) ( ^ (s ; s > 0). 



Thus, (2e 2 p e ( [s/2e\ ) ; s > 0) ^— > (s ; s > 0) and (10) combined with the convergence of p e and 
(46) imply 
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The joint convergence (47) combined with the independence of f(r e ) and f(r') also implies 

(etf L . /2e2J (? e ) , eif L . /2e2J (?;))^ — (2D , 2D') . 

Use (5) and arguments similar to those used in the proof of Theorem 2.4.1 [5] to get 

[eC s/ Ar.) , e ^>(^)) s>o (2D , 2D*) . (51) 

Set <x X)£ = sup{s > : C s (r e ) < x e } and define a* e in a similar way. Recall notations a x , a' 
and D^ introduced before Proposition 3.5 and deduce from (51) that 

It easily implies Lemma 3.6 (i) by (12). Then, argue exactly as in the proof of Proposition 3.5 
to deduce Lemma 3.6 (ii) from Lemma 3.6 (i). M 

We now have to prove Proposition 3.5 (ii). In one part of the proof we adapt Aldous' 
approach (Theorem 20 [3]) and we get estimates for the tree rf reduced at certain random 
times. The main technical difficulty is Lemma 3.7 that asserts that these random times are 
asymptotically uniformly distributed. Let us first define these random times: Let {Ui\i> 1) be 
a sequence of i.i.d. random variables independent of W e and uniformly distributed on (0, 1). Let 
uq = < u\ < . . . < u#fjF-i be the vertices of rf listed in the lexicographical order. We set 

Ui(x, e) = u Lw . #?? j and Vi(x, e) = Tr^ (Ui(x, e)) € U. 
Then Vi(x, e) G {W^ ; < n < Cx,e} and the row of the corresponding vertex in rf is given by 

Vi(x,e)= ^2 1 {z v (f*)>o}- 

v<Vi(x,e) 

The key argument is the following Lemma that is proved in the next section. 
Lemma 3.7 For any i > 1, the following convergence holds in probability: 

^{vi(x,e)-^Ui#T^j —^0. 

From now on until the end of the section we assume that Lemma 3.7 is true and we prove 
Proposition 3.5 (ii): Fix x > and set for any 5 > 0: 

u) (H(x,e),5) = sup{\H s (x,e) — H s /(x,e)\ ; Is — s'l < <5}. 



28 



T. Duquesne. 



Limit of random walks on trees. 



We first prove tightness for H(x,e), e > 0: By a standard criterion (see for instance Corollary 
3.7.4 [7]) we only need to prove 

(Tl) lim liminfP ( supH s (x,e) < M ) = 1 

and for any rj > 0, 

(T2) limlimsupP(u;(F(x,e),<5) > rj) = 0. 
Proof of (Tl) : Note that the mapping Tr preserves height. So, we get 

supH s (x,e) = e sup{Tr^ (n) : u G rf } = sup eiTi s / 2e 2i (rf ) 

s>0 e 0<s<2e 2 #?f 

which is a tight family of random variables by Lemma 3.6. ■ 

Proof of (T2) : Let k be a positive integer and p be a permutation of {1,. . . ,k} such that 
V p m(x, e) < . . . < V p f k \(x, e) in U. It implies 

V p (Q){x,e) < V p (i){x,e) < ... < V p{k ){x,e) < V p{k+1 )(x,e) 

where we set = V p ^(x,e) and #rf = V p ( k+ i)(x,e). We first need to get an upperbound for 
the quantities qi defined for any < i < k by 

qi = sup | |#„(T,f ) - % p( . )(X)e) «) ; V p(i) (x, e) < n < F p(m) (x, e) J . 

Observe that qi can be rewritten 

qi = sup 1 1 M - |V^ (i) (x,e)| | ; u G Tr^(rf) and V^ (i) (x,e) < t> < V^ (i+1) (x, e)| . (52) 

Set 

w (k,x,e) = max % , wi(k,x,e) = max | |V^ i+1) (x,e)| - \ V p{i )(x,e) \ | , 

0<i<K 0<i<k 

and 

A(&,x,e) = max d (v , {0, Vi(x, e), . . . , V k (x, e)}) . 

(52) easily implies 

u>o(fc, x, e) < w±(k, x, e) + A(/c, x, e). (53) 
Since Tr preserves height, we get for any i > 1, 

|Fi(x,e)| = |C/i(x,e)| = fl- LM . #5?J (7f). 

Then by Lemma 3.6 we get the following convergence in distribution 



ewi(k,x,e) > max 

e^O 0<i<k 
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where = £/( ) < U(X) < ■ ■ ■ < U(k) < M(k+i) = 1 denotes the increasing re-ordering of 
{0,l,Wi,...,W fc }. Thus, 



Vr/ > , lim limsupP (e x, e) > r/) =0. (54) 



We next want to prove 

Vr/ > , lim lim sup P (eA(k, x, e) > r?) =0. (55) 

k— >oo 

To that end, observe that for any u,u' € rf , d(Tr~. x (u),Trj. x (u')) < d(u,u'). Then if we set 
A'(/c, x, e) = max d (u ; {0, ?7i(x, e), . . . , ?7fc(x, e)}) , 

we get 

A(k,x,e) < A'(k,x,e) (56) 

and we control A'(k,x,e) thanks to Lemma 3.6 (the following argument is directly inspired 
from the proof of Theorem 20 [3]): With any I G {0, . . . , #rf — 1} we associate the index 
£ {0, ...,k + 1} such that is the smallest element y € {0, 1, U±, . . . ,Uk} such that 
I < [y#r*\ . Check that 

A' (iw) < q m„ f («,(??) + H Vm „^) - 2 iQs mf #??J *,<??)) . 

Lemma 3.6 implies that the right member of the previous inequality converges in distribution to 

sup (d& + D\f , - 2 inf D^A , (57) 

where we denote by U^Cx the smallest element y G {QxMiCx-, ■ ■ ■ MkQx} such that s < y (recall 
that ( x stands for the lifetime of the process as defined before Proposition 3.5). We easily 
check that (57) converges to in probability when k goes to infinity since 



sup ((xUi(s) ~ s) < max Uu+u - Uu\ > 

0<s<C x 0<i<k k^oo 

in probability. Thus, it implies (55) by (56) . Finally, as a consequence of (53), (54) and (55) we 
get 

Vr/ > , lim lim sup P (e u>o(k, x, e) > r/) =0. (58) 

k^oo e-+0 

Then, check that on the event 

E(k, x, e, S) = | min^ e 2 (V p (j+i) (x, e) - V p ^ (x, e)) > 5 j 



30 



T. Duquesne. 



Limit of random walks on trees. 



the following inequality holds a.s.: 

u(H(x,e),S) < Swo(k,x,e). (59) 

Use Lemma 3.7 to get 



in distribution. Thus, 



Vfc>l, limliminf P (E(k,x,e,S)) = 1 

5^0 e^O 



Easy arguments combined with (58) and (59) achieve the proof of (T2) and at the same time 
the tightness for H(x, e), e > 0. ■ 

It remains to prove that (2D^$ ; s > 0) is the only possible weak limit for the processes 
H(x, e), e > 0. Tightness for the H(x, e)'s, e > and Lemma 3.6 imply that the joint distributions 
of (H(x,e) , 2e 2 #rf) , e > are tight. Assume that along a subsequence e p — > the following 
joint convergence 

holds for some continuous process H' and some positive random variable Lemma 3.7 implies 
(H(x, e p ) ; 2e 2 p #r* p ; 2e p v V 1 (x, e p ) , . . . , 2e 2 p V k {x, e p )) 

-^- r {H'-X\U 1 (,'h,...Mh) 
where the UiS are chosen independent of (H 1 , £'). Since Tr preserves height, we get for any i > 1 

Hv^eJO = \Vi{x,e p )\ = \Ui(x,e p )\ = H [u . #% }(t? p ). 
Then Lemmas 3.6 and 3.7 imply for any k > 1 



(^FiO^)^ £ p) > • • • > H 2eiv k (x,e p )( x > e P ) ! 2e p#< P 5 2eJVi(a;, e p ) , . . . , 2e 2 p V k (x, e p 

( 2 <i , • • • , 2^ ; O ; WiCx/7 , • • • , WkCr/7) 



in distribution. Consequently, 



( H Uc>h . • • • . ; C ; W1C/7 , • • • , w fc c77) 

( '= } ( 2j d£1 , . . . , 2^ ; G ; WiC/7 , • • • , WfcC/7) • 
It implies (i?^ ; s > 0) ^ = (2D^s ; s > 0), which achieves the proof of (45). I 
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3.3 Proof of Lemma 3.7 

We introduce the notation 

Ui(x,e)= Yl Z v(V) 

v<Vi(x,e) 

and we first prove the following convergence in probability: 

e 2 ^(x.eJ-^Tf) -+0. 



(60) 



Proof: Deduce from (44): 

{u G 5f : Tr^(u) < F;(x,e)} C {u € 5f : u < ?7j(x,e)} c{«£ 5f : Tr^(n) < V$(x,e)} 
which implies 

< Ui{x,e) - [U t #r?\ < Z vdx>e) (f e x ). (61) 
Then observe that for any »£ll, 

_ ^(^) 



P(Fi(x,e) =v 
Thus, (61) and Cauchy-Schwarz inequality imply 

E[\Ui(x,e)-Ui#T?\] < 1 + E 



Since #rf = £„ eU ^(T/), we get £ weU Z„(7;*) 2 < (#rf) 2 and 



1/2 



E 



1/2 



1/2 



(62) 



E [|J7<(o;,c) -Z^#Tf |] <1 + E 
Remark 3.1 and (43) imply that r e is a GWI-tree with immigration distribution v = /j,, so that 

= = and gU){1) = Jl (ij ' j - L 



Then, by Proposition 3.4 (ra) 



E 



E z ^) 



K a x e K a x . . .. 
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Thus, 



E [e 2 | Ui(x, e) - Z^#rf |] < K a , x e(l + o(l)) 



and (60) follows. 

Then, Lemma 3.7 is a consequence of the convergence in probability: 

0. 



e 2 ( Vi(a;,e) - - Ui(x,e) 



(63) 



Proof of (63): We need several preliminary estimates (Lemmas 3.8 and 3.9) whose proofs rely 
on Propositions 3.3 and 3.4. We first consider a random marked GW-forest with I elements 
T e = (<f e ; (fj, u , u G ip e )) as defined at Proposition 3.3: recall that tp e = (ri, . . . rj) is a forest of I 
i.i.d GW(/x)-trees and that the marks (u u ,u G ip) are i.i.d. conditional on ip e , their conditional 
distribution being given by a. Set 7i >e = (n ; (fJ< u ,u G ri)) and define 



1/7, 



E [EagU 1 {Z„(T 1 , e )>0}] 

E [£„<=„ ^(T 1>e )] 



We also set 



P O^e) = Yl Z '"^) ~ 7e 1 {Z„(^ e )>0}- 

i>eu 



Lemma 3.8 First (i) lim e _>o7e = 7 and for any I > 1, 

(ii) 0<E[/3(JT e )] <K a l{l-l), 



(Hi) 



E 



1 - d/u 

Proof : Let us prove (i): First observe that 



and thus E [\(3 (F e )\] < KJ 2 (l - d/u)- 1 / 2 . 



1/7, 



£„ eU i-/„(o) 



£„gu /^(l) 

Then, Proposition 3.3 implies that for any v = m\ . . . m n G U 

^^(l) = ^a,(d/n)H = _J. 



d/u 



and also 



1 - fv(0) = a v 



d) ^ 
u 



u 



Qm.^, • • • Ojr, 



n-1 



Q"rrin • ■ ■ O-rno +••• + ! 
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Thus, we get 



1/7, = E 



i + x l - + ... + x 1 x 2 ...x G 

u 



(i-/,(o)) z < 



a„ < a * a, ; 



Thus, 



(64) 



where we recall that the sequence of random variables (X n ;n > 0) is distributed as specified 
after formula (1), and where G stands for an independent random variable whose distribution is 
given by P(G = n) = (1 — d/u)(d/u) n , n > 0. Since lim e ->o d/u = 1, an elementary argument 
implies 

lim l/ 7e = E [(1 + X 1 + X X X 2 + X ± X 2 X 3 + . . .r 1 ! = l/ 7 . 

e— >0 L 

Let us prove (ii): Deduce from Proposition 3.3 that 

E[/?(^)] = £//^l)- 7e (l-/„(0)'). 

The definition of 7e implies 

i 2 f v {\) - 7e (i - / w (o)) = £ ~ he(i - W)) = 0. 

i>GU v&J 

We then subtract this expression to (64) and we get 

E [(3 (F e )} = 7e £ /«(°)' " 1 + Ul " /«(0)) • 

i)GU 

Then, use the elementary inequality (1 — x) 1 — 1 + Ix < 1(1 — l)x 2 /2, x G [0, 1] to get 

E^f £ )]<^E(l-/,(0)) 2 
Deduce from the explicit computation of 1 — f v (0) recalled above that 

' d \*\v\ 



(65) 



£(i- a-(o)) 2 <J2< £ v-« m „<(i- a+r 1 

t)GU n>0 mi,...,m„6N* 

and (ii) follows from (i). 

It remains to prove (Hi): For convenience of notation, we simply write (5 and Z v instead of 
(3 (J 7 ,,) and Z v (F e )- Check that 



E [I5 2 ] = E [E x ] + E [£ 2 ] , 



(66) 
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where 



and 



El= /2 ( Zv ~ 7e 1 {Zv>0}) ( Z v> ~ le !{Z V ,>0}) 



v,v'ev 

vAv' £{v,v'} 



E 2 = ^2 ( Zv ~ 7e 1 {Zv>0}) ( Z v> ~ le 1{Z„,>0}) 

v,v'eu 

vAv' £{v,v'} 

(note that in the two sums all but a finite number of terms vanish). Define for any w £U 

flw = ^2 Zwv ~ 7e 1 {Z WV >0}- 

Ei can be rewritten as follows 

El — ^ ^ ^ ^ PwiPwj- 

Deduce from Proposition 3.3 that conditional on (Z w i,Z w j) (with i ^ j) the random variables 
fi Wi and (3 Wj are independent and distributed as (3 with resp. I = Z w i and I = Z w i. Use (ii) to 
get 

E [PwiPwj I { Z wii Z wj)} = E [/^ | Z w i]~E [P Wj I ^luj] 

< K a Z w i(Z w i — X)Z w j(Z w j — 1)- 

By Proposition 3.3 again, we get 

E [x^y^ | Z w ] = / (1 - Oi - Oj + a;x + a i2 /) z ™ . 

Recall that 



d k f\^_fdY(n + k-l)\ fMk 



dx k 



■(x) 



u J (n — 1)! 



/(*) 



(67) 



Then, 



E [Z W i(Z W i — l)Z w j(Z w j — 1) | Z w ] — afoj ^— ^ (Z^ + 3) 4 < 12Z^,. 



Thus, by Proposition 3.3 



E [Ex] <K a E 



wev 



I 4 



l-d/u 

We get a similar upper-bound for E [E2] by first noting that 

E 2 <2j2( Z w-lel{Z w >0})Pw. 

wev 



(68) 
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Apply Proposition 3.3 (i) and Lemma 3.8 (ii) to get 

E [(Z w - le l {Zw>0 }) 0w I Z w ] < K a Z 3 w . 
By Proposition 3.3 (Hi) again 

/ 3 



E [E 2 ] <K a B 



Then (Hi) follows from (66), (68) and (69). 



.wev 



l-d/u 



(69) 



We need similar estimates for a marked GWI(/it, r)-forest Fq :€ whose distribution is the same 
as in Proposition 3.4: recall that r is some fixed repartition probability measure on {(k, I) G 
N* x N* : I < k}. We denote by v the corresponding immigration probability measure given 
by v(k — 1) = Si<[<fc r (M) , k > 1 and we set g(r) = J2k>o u (k)r k . We define JF 0e as 
(<Po,e] (^«) u G 9?o,e)) where ipQ >e = (to, t±, . . . , r;), the Tj's are mutually independent, t\, . . . , t\ are 
i.i.d. GW(//)-trees, To is a GWl(p, r)-tree and conditional on ipQ <e the marks [x u are i.i.d. random 
variables distributed in accordance with a. Recall notations 

< = <(<A),e) , v* = Tr^ 0f (u* n ) , Sp = {v*i , i G N* \ {fi u *J,n> 0} 

and recall that S is the a-field generated by the random variables ; n > 0) and (^(^o.e) , w G 
Sp). For any n > 1 we also set 



We set 



Sp(n) = {to e Sp : | to | < n} U {<}. 
Ao ([^b,e]<) = ^^(P^ek*) ~ 7e 1 {^([^o, E ]<)>0}- 



Lemma 3.9 For any n>l, 



E 



sup 

AcSp(n) 



Au([-^b,e]tt* 



< if a n(l + 1) 2 (1 - d/n)" 1 / 2 max (l, g'(l) 2 , g"(l) 2 ) 



Proof : To simplify notation we write (5 W and Z w instead of (3 w ([Fo,e]u* ) and Z w ([!Fo te \ u * ). We 
also denote by E 5 the 5-conditional expectation. Let A C Sp(n). From Proposition 3.4 (i) 
we deduce that conditional on S the (/3 w ;w G Sp(n)) are independent random variables and 
that for each w G Sp(n), conditional on Z w = I, (3 W is distributed as the random variable (3(T e ) 
defined at Lemma 3.8. Apply Lemma 3.8 to get 



E 



weA 
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< K^l-d/u)- 1 ' 2 Z l- 

iuSSp(n) 



Next, use Proposition 3.4 (ii) to get 



E 



u>6Sp(n) 



n-1 



fc=0 «eN* 

< K a n(Z + l) 2 max(l,</(l)V'(l) 2 ), 
which achieves the proof of the lemma. 



We now comme back to the proof of (63) and we apply the previous results to the marked 
sin-tree % = (r e ; (]2 u ,u G r e )). For convenience of notation, we fix i and we set 

U = Ui(x, e) , V = Vi(x, e),U = U^x, e) , V = Vi(x, e). 

We keep the notations u* n = u*(r £ ), v* n = Tr^(n*), Sp, Sp(n) and <S. Recall that T e x = [%] u * 
and that for any »6D that is not a descendant of v* 



Zw I Te ) — %w ( 



(70) 



For convenience of notation, we set for any w € U 

Z w = Z w {f^j and W = Y1 Z ™'» ~ iMZwv > 0}. 



Since lim e ^o7e = 7, we only have to show 

e 2 (U- 7e V) =e 2 ^2z v - le l {Zv>0} 



v<V 



(71) 



in probability. To that end, we first introduce the random word 

W = max{u; € Sp(x e ) : w < V}, 

where the maximum is taken with respect to the lexicographical order on U. There are two cases: 

• If V $l [[0, v*^]], then we can find V' € U such that V = WV' and we set in that case 
A = {w € Sp(x e ) : w < W}. 

• If V € [[0, < e _i]], then we set A = {w G Sp(x e ) : w < W}. 
Then, check that U — ^y e V = e±(e) + e2(e) + 63(e) with 

e l( e ) = S Z «-7el{Z.>0}, 

«e[0,«; e _il 
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e2(e) = ^2 (3 W and e 3 (e) = ^{v^0,v* c _J} /2 Zwv ~ T« 1 {^>o}- 

The limit (71) is then implied by the following convergences 

e 2 E [| ei ( e )|]— -.0, (72) 

e 2 E[|e 2 ( £ )|] -.0, (73) 

e— >0 

6 4 E[e 3 (e) 2 ] — ^O. (74) 

Proof of (72) : Use Proposition 3.4 (u) with p = 1, I = 0, n = x £ — 1 and g(ar) = f(x) = 
u/ (1 — dx) to get 

x e — l 

E[|ei(€)|] < £ E + (z e - l) 7e 

i=0 

< K a d(x e - l)/« + (s e - l) 7e < K ai3; e" 1 

which obviously implies (72). ■ 

Proof of (73) : We use Lemma 3.9 with n = x e , I = and g(x) = f(x) = u/(l — dx) and thus 
g^(l)=j\(d/uy,toget 

E[|e 2 (e)|] < K a x e (l - d/u)- 1 / 2 < K^ x e~ z ' 2 

which implies (73). ■ 

Proof of (74) : It requires more complicated arguments. Let wq € U and let I be a positive 
integer. We define E(wq, I) as the event {W = wo ; Z m = I}. We first get an upper-bound for 

S(w ,l) = E[e 3 (e) 2 | E(w ,l)] . 

Let T = (<p; (fJ, u ,u G ip)) be a marked GW-forest with I elements as defined at Proposition 3.3. 
Pick uniformly at random a vertex U(T) in ip and define V(!F) € U by V(JF) = Tr^ilA^)). As 
a consequence of Propositions 3.3 (i) and 3.4 (i), we get the following identity 

(Z wov ,«6U;V) under P ( • | E(w , I)) ( '= } [Z V (T) ,«eU; V(T)) . (75) 
Let G be the function on F defined by 

G ([F\u{T)) = /2 z v{F) -7el{z4^-)>o}- 

v<V(T) 
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Then, (75) implies 



E 



E 



E G ^«) 2 



< (1 + 7,)E 



,u£<p 



(76) 
(77) 



since for any u £ if, 



E ^(^)<l+7e 



«<Tr ? («) 



We now estimate the right member of (77) thanks to (4): Recall the notation ip\, for a 
size-biased forest with I elements, i.e. a GWI(/x, r)-forest with I elements where r is given by 
r(k,j) = ud k /p, , 1 < j < k with /2 = X^o^M^) = d/u. Thus the corresponding immigration 
distribution is u(k) = (k + l)u 2 d k , k > and its generating function is g(r) = u 2 /(l — dr) 2 . 
Let us define the random marked GW/-forest T\, as (</?[, ; G y?b)) where conditional on ip\, the 
//„'s are i.i.d. with distribution a. Deduce from (4) that 



E 



rt>0 



Em- e [I g (ki<w))I] 



(78) 



Set as usual Vn(F\>) = l^^b ( u n(^i>)) an d observe for any n > 

= E - 7el{z„(^ b )>0} 

t '<<(^ r b) 

n-1 

= E^(^)+E%w-^ 

weAi, i=0 



where we have set 



and A|, = {w € Spt, : < u*^)} with 

s Pb = K_i(-n)* ;i€N*\ K;^)} . k ^ !>■ 

Then, 



E[|G([^] <W )|] <E 



E 


+ E 


me4 





n— 1 



E%(-^b) -7« 



i=0 
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Use Lemma 3.9 with g(x) = u 2 /(l — dx) 2 to get 



E 



E /w) 



< K a nl 2 (l-d/u)-V 2 
and use Proposition 3.4 (ii) with p = 1 and g(x) = u 2 /(l — cte) 2 to get 

n-l 



E 



i=0 



< K a n/. 



These inequalities imply 



avj ,i)<K a i 3 (i-d/uy 1 / 2 Y,^(-) <k> 

n>0 W 



(I -d/uf I 2 ' 



(79) 



We now comme back to the proof of (74): by (79), we get 

E M«) 2 ] = E ^ l)*(W = w ; Z w = l) 



i>i 



< 



K a 



(1 - d/uf/ 2 



E 



E ^o 1 {W=wo} 
u)o€Sp(rr e ) 



Then, set for any w$ € Sp(x e ), Cw = X^eu Z wqv and observe that P(P^ = wq \ S) = Cwo/# r f- 
Thus the previous inequality implies 



K a 



< 



K a 



(1 - d/«) 5 /2 



E 



w eSp(x € ) 

1/2 

E < e 

w eSp(x e ) 



C 2 

. . . f#7? ) 2 



i«oGSp(a; E 



(#rf) 



1/2 



But Etoo6Sp(x«) C < (# r f) 2 since 1 + x e + E woe s P (x e ) C™ = # r f • Then , use Proposition 3.4 
(ii) with p = 6, I = and g(x) = f(x) to get 



E [e 3 (e) 2 ] < K a - 



,1/2 



(1 - d/uf/ 2 



<K a/x e-'\ 



which implies (74). 
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